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DNA Se quences Encoding the Human Al. A2a and A2b 

Adenosin e Receptors 
Field-of the Invention 

The present invention relates to DMA sequences 
5 encoding the human Al f A2a and A2b adenosine receptors. 
In addition, the present invention relates to the use of 
these DNA sequences in the production of the human Al, A2a 
and A2b adenosine receptors using recombinant DNA 
technology. 

10 Background of the Invention 

Adenosine influences cardiovascular function (by 
slowing heart rate and decreasing blood pressure) and also 
influences nervous system function (through sedative and 
anti-epileptic effects). In addition, adenosine can 

15 induce bronchoconstriction. Adenosine binds specifically 
to at least three receptors, Al and A2a and A2b. 
Adenosine receptors have been shown to couple to a number 
of second messenger systems. Additional adenosine 
receptor subtypes may exist. As adenosine receptor 

20 agonists and antagonists may have commercial value as 

anti-hypertensive agents, hypnotics, anti-psychotics end 
bronchodilators , the ability to produce adenosine 
receptors by recombinant DNA technology is advantageous. 
The present inventors have isolated three related 

25 cDNA fragments encoding the human Al, A2a and A2b 

adenosine receptors from human hippocampal cDNA by using 
either the. polymerase chain reaction and unique degenerate 
oligonucleotides to generate specific probes or by using 
specific consensus oligonucleotide probes for cDNA library 

30 screening. Full-length cDNA clones for each of the three 
receptors were isolated from a human hippocampal cDNA 
library. The receptor sequences were identified as the 
human Al, A2a and A2b adenosine receptors by expression in 
mammalian cells and both measurement of the affinity of 

35 the encoded receptors for various adenosine analogues and 
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the effect of receptor activation on cAMP synthesis. The 
receptors have homology to cDNA's encoding the dog Al and 
A2a adenosine receptors (MAENHAUT, C, VAN SANDE, J. , 
LIBERT, F., ADRAMOWIC, M., PARMENTIER, M. , 
5 VANDERHAEGEN, J., DOHONT, D. , VASSART, G. AND 

SCHIFFMANN, S. (1990); LIBERT, F., SCHOFFMANN, S.M., 
LEFORT, A., PARMENTIER, M. , GERARD, C, DUMONT, J.E., 
VANDERHAEGHEN J.J., VASSART, G. (1991)) and the rat A2b 
adenosine receptor (STEHLE, J.H., RIVKEES, S.A., 
10 LEE, J.J., WEAVER, D.R., DEEDS, J.D. AND REPPERT, S.M. 

(1992)). These hippocampal cDNA sequences represent novel 
human receptors which may be of clinical and commercial 
importance . 

Summary of the Invention 

15 Accordingly, in a first aspect the present invention 

consists in a DNA molecule encoding the human Al adenosine 
receptor, the DNA molecule having a sequence substantially 
as shown in Figure 1 or a functionally equivalent sequence. 
In a second aspect the present invention consists in 

20 a DNA molecule encoding the human A2a receptor subtype, 
the DNA molecule having a sequence substantially as shown 
in Figure 2 or a functionally equivalent sequence. 

In a third aspect the present invention consists in a 
DNA molecule encoding the human A2b adenosine receptor 

25 subtype, the DNA molecule having a sequence substantially 
as shown in Figure 3 or a functionally equivalent sequence 

As used herein the term -functionally equivalent 
sequence" is intended to cover variations in the DNA 
sequence which, due to degeneracy of the DNA code, do not 

30 result in the sequence encoding a different polypeptide. 
Further, this term is intended to cover alterations in the 
DNA code which lead to changes in the encoded polypeptide, 
but in which such changes do not affect the biological 
activity of the polypeptide. 

35 As used herein the term "DNA molecule- is intended to 
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cover both genomic DNA and cDNA. 

In a fourth aspect the present invention consists in 
a method of producing the human Al adenosine receptor 
comprising culturing a cell transformed with the DNA 
5 molecule of the first aspect of the present invention 
under conditions which allow expression of the DNA 
sequence such that the human Al adenosine receptor is 
expressed on the cell surface and optionally recovering 
the human Al adenosine receptor. 

10 In a fifth aspect the present invention consists of a 

method of producing a human A2a adenosine receptor 
comprising culturing a cell transformed with the DNA 
molecule of the second aspect of the present invention 
under conditions which allow expression of the DNA 

15 sequence such that the human A2 adenosine receptor is 
expressed on the cell surface and optionally recovering 
the human A2a adenosine receptor. 

In a sixth aspect the present invention consists of a 
method of producing a human A2b adenosine receptor 

20 comprising culturing a cell transformed with the DNA 
molecule of the third aspect of the present invention 
under conditions which allow expression of the DNA 
sequence such that the human A2 adenosine receptor is 
expressed on the cell surface and optionally recovering 

25 the human A2b adenosine receptor. 

In further aspects the present invention consists of 
a method of screening a molecule for adenosine agonist or 
antagonist activity , comprising contacting the molecule 
with the human Al, A2a or A2b adenosine receptors produced 

30 by the method of the fourth, fifth or sixth aspect of the 
present invention. 

In yet a further aspect the present invention 
consists in oligonucleotides 305, 377 and 376 as 
hereinafter described. 
35 The DNA molecules of the present invention represent 
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novel human receptors. These receptors may be of interest 
both clinically and commercially as they are expressed in 
many regions of the body and as adenosine affects a vide 
number of systems, 
5 The isolated full-length DNA clones containing the 

complete coding region for these receptors can be used to 
establish mammalian cell lines producing the receptors for 
use in agonist and antagonist screening. The receptor DNA 
sequence can be used for additional homology screening to 
10 identify novel members of this receptor family. 

In order that the nature of the present invention may 
be more clearly understood preferred forms thereof will 
now be described with reference to the following examples 
and figures in which 
15 Figure 1 shows the nucleotide and amino acid sequence 

of the human Al adenosine receptor cDNA. 

Figure 2 shows the nucleotide and amino acid sequence 
of the human A2a adenosine receptor cDNA* 

Figure 3 shows the nucleotide and amino acid sequence 
20 of the human A2b adenosine receptor cDNA. 

Figure 4A shows saturation isotherms of the total 
(unfilled triangle), specific (filled circle) and 
non-specific (unfilled square) binding of the Al adenosine 
receptor antagonist DPCPX (8-cyclopentyl-l,3 
25 dipropy lxanthine ) to mammalian CHO.K1 cells expressing the 
human Al adenosine receptor. 

Figure 4B shows competition binding curves showing 
the displacement of CGS-21680 

2-p- ( 2-Carboxyethyl ) phenethy lamino-5 • -N-ethylcarboxyamido 
30 adenosine hydrochloride) by different adenosine agonists 

and antagonists (NECA « 5 1 -N-ethylcarboxamido adenosine; 

CA= 2 -chloroadenos ine ; CPA=N 6 -eye lopenty ladenos ine ; 

XAC=xanthine amine congener; T«8-(p-sulphophenyl)- 

theophylline) in mammalian HEK 293 cells expressing the 
35 human A2a adenosine receptor. 

■ 
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Pigure 5 shows the effects of the different adenosine 
receptor subtypes , Al, A2a and A2b upon cyclic AMP 
production. Al adenosine receptor activation leads to 
inhibition of forskolin stimulated cAMP levels. 
5 Activation of both the A2a and A2b adenosine receptors (by 
CGS-21680 and NBCA, respectively) leads to stimulation of 
cAMP levels . 

METHODS 

Oligonuc leotide Design and Synthesis 

10 Unique degenerate oligonucleotides corresponding to 

the transmembrane II (TM II) and IV (TM IV) regions of G 
protein-coupled receptors and containing either a 5' EcoRI 
restriction enzyme site (TM II oligonucleotide 377) or a 
3' Hind III restriction enzyme site (TM IV 

15 oligonucleotides 305 and 376) were synthesized on an 
Applied Biosystems automated DMA synthesiser. The 
sequences of the oligonucleotides are as followsx- 

305 5 ' - CCCAATAAGCTTAG I CC IATGGCGAAAGACAGGACCCA- 3 ' 
20 A A G G C 

A A 

376 5' - GAGTCCGAAGCTTAGTGGGCAAGAGATGGCGAAIGAIAGIACCA-3 ' 

G TA C A G 

25 T A 

377 5* - CAGAACGAATTCAATGTTTTTATGTGGTCTTTGTCITC 1 

C GG GGGC 

A 

30 

The DNA sequences included inosine (I) residues. 
Crude oligonucleotides were then used in the polymerase 
chain reaction. 
PCR Amplification 

35 Sequences homologous to the G protein-coupled 
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receptor oligonucleotides were amplified from human cDNA 
using PCR and the Bybaid thermocycler . DNA was prepared 
from -a human neuroblastoma (Clontech) cDNA library in 



m 




Ttl 


•> 



5 library in lambda ZapII. DNA was prepared by phenol and 

P 

chloroform extraction of approximately 10 library phage 
and ethanol precipitation to recover the DNA. DNA from 
the cDNA libraries (l-5jig) was incubated with 200pM of 
each dNTP, O.SjiM oligonucleotide, 0.5 units Tth enzyme 
10 (Toyobo) in 50mM KC1, 50mM Tris-HCl pH9.0, 1.5mM MgClj 

(1 x PCR buffer) in a 50pL reaction volume. Samples were 
layered with 50jiL light mineral oil (Sigma). 
Reactions were denatured for 5 minutes at 95°C. The PCR 
conditions were as follows: Denaturation for 2 minutes at 
15 92°C, annealing for 2 minutes at 55°C, and extension 
for 2 minutes at 92°C, 2 minutes at 50°C, and 
2 minutes at 70°C, repeated five times; theft 2 minutes 
at 95°C, 2 minutes at 45°C r and 2 minutes at 70°C, 
repeated thirty times. 
20 Subclonino and Sequencing of Amplified DNA Fragments 

Amplified DNA (20pl) was removed and analysed by gel 
electropheresis in 1% agarose and 3% NuSieve (SeaKem). 
Amplification products 260bp-330bp in length were excised 
from the gel and purified with Geneclean. 
25 DNA fragments were then digested with Hind III for one 



hour at 37°C and EcoRI for one hour at 37°C, the DNA 
again purified with Geneclean and eluted into lOjil 
H 2 0. Digested DNA fragments were then subcloned into 
M13mpl9 and sequenced by the Sanger dideoxy 
30 chain- termination method using the Pharmacia or the 
Promega DNA sequencing kit. Sequencing reactions were 
analysed on a 6% acrylamide, 7H urea gel, dried onto 
Whatman 3M paper, and exposed to X-ray film for sixteen 
hours (Kodak X-OMAT AR5) at room temperature overnight. 
35 Sequence Analysis of Novel DNA 
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* 

Sequence analysis of the DNA fragments generated from 
the PCR amplification identified two DNA fragments that 
had sequences common to other known G protein-coupled 
3&£eptors. PCR amplification of neuroblastoma cDNA with 
5 thfti4egenerate oligonucleotides 377 and 305 produced a 

cDNfcj-feragment which was designated 3.1. PCR amplification 
of huflOft hippocampal cDNA with the degenerate 
oligonucleotides 377 and 376 produced a cDNA fragment with 
a sequence that was 76% homologous at the nucleotide level 

10 to sequence 3.1 and was designated 3.2 The DNA sequences 
were searched on the GenBank and EMBL databases for 
comparison to known sequences and were confirmed to be 
noyel sequences with a high level of homology to dog 
adenosine Al and A2 receptors. 

15 Isolation of Pul l-Lenoth cDNA Clones 

Full-length cDNA clones encoding t,he Al receptor as 
well as receptor sequences corresponding to 3.1 and 3.2 
were isolated from a human hippocampal cDNA library 
(Stratagene) • 

20 Al adeno sine receptor cDNA isolation 

Specific consensus oligonucleotides corresponding to 
the second extracellular loop (679), and to the third 
intracellular loop (678) were synthesised on ah Applied 
Biosys terns automated DNA synthesiser. The sequences of 

25 the oligonucleotides are as follows :- 

678 5 - CCO^AGTACTTCTGCGGGTCGCCAGAGGAGGCGACACCT 1 

679 5 • -GAGGCGCAGCGGGCCTGGGCGGCCAACGGCAGCGGCGGCGAGCCCGTG- 3 1 
30 Approximately 5 x 10 5 plaques were plated on C600 

HflA bacterial cells. Plaques were lifted on to 
Hybond-N+nylon filters (0.45/iM, 137mm, Amersham) . DNA 
was denatured on the filters with a 3 minute incubation on 
0.5 M NaOH, 1.5M NaCl and neutralised with a 5 minute 
35 incubation in 0. 511 Tris pH72, ImM EDTA and 1.5M NaCl. DNA 
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was fixed to the filters with a 15 minute exposure to 0.4M 
NaOH. Filters were then rinsed in 2 x SSC (3H NaCl, 0.3H 
sodiuih citrate) and allowed to dry before a 30 minute 
prehybridisation in 40% formamide, 5 x SSC, 5 x 
5 Denhardt's, 50mM NaPO^, 0.5% sodium dodecyl sulphate 
(SOS), O.lmg/ml salmon sperm DNA at room temperature. 
Oligonucleotides 678 and 679 were pooled and 50 pmoles 
total were radiolabelled using -ATP and the DNA 5* 
end-labelling system (Promega) . The filters were 

10 hybridised with this radiolabelled probe overnight at 
42°C, after which time they were washed once briefly in 
2 x SSC at room temperature then twice for 10 minutes each 
wash in 2 x SSC, 0.1SDS at room temperature with a final 
wash in 0.1 x SSC, 0.1%SDS for 15 minutes at 50°C. The 

15 filters were then exposed to Kodak X-0MAT AR5 film 
overnight at -70°C. Over twenty pure phage isolates 
which hybridised to the radiolabelled 678 and 679 
oligonucleotides were obtained. Several of these 
different cDNAs were sequenced. The sequence of one such 

20 cDNA (together with the deduced amino acid sequence) which 
encodes the human Al adenosine receptor is shown in 
Figure 1. 

A2a and A2b adenosine receptor cDNA isolation 

Approximately 1 x 10^ plaques were, plated op 

25 CSOOHflA bacterial cells. Plaques were lifted onto 

Hybond-N nylon filters (0.45^K, 137mm, Amersham) . DNA 
was denatured on the filters with a 3 minute incubation on 
0.5M NaOH, 1.5M NaCl and neutralised with a 7 minute 
incubation in 0.5M Tris pH 7.2, ImM EDTA and 1.5M NaCl. 

30 Filters were rinsed in 2 x SSC (20 x SSC is 3M NaCl, 0.3M 
sodium citrate) and DNA fixed to the filters with a 5 
minute exposure to ultraviolet light (312nm). Filters 
were prehybridied in 5 x SSPE (5 x SSPE=0.5M NaCl, 0.05M 
NaH 2 P0 4 , 0.0005M EDTA, pH 7.7), 5 x Denhardfs (0.1% 

35 (w/v) bovine serum albumin, 0.1% (w/v) Ficoll, 0.1% (w/v) 
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polyvinylpyrollidone) , 0.5% sodium dodecyl sulphate (SDS), 
0.2mg/ml salmon sperm DNA at 65°C for 17 hours. The 
filters were hybridised with a radiolabeled probe 
corresponding to the PCR amplified DNA fragment encoding 
5 the 300 bp of 3.1 (labelled with (a- 32 P)-dCTP using the 
random primers DNA labelling system (Bethesda Research 
Laboratories)). Following hybridisation of the 
radiolabelled probe for 20 hours at 65°C, filters were 
washed with 2 x SSPE, 0.1% SDS at room temperature for 10 

10 minutes, then with 1 x SSPE, 0.1% SDS at room temperature 
for 10 minutes and exposed to Kodak X-0MAT AR5 film for 
seven days at -70 C. Two pure phage isolates were 
hybridised to the radiolabelled 3.1 DNA fragment were 
obtained. The two DNA inserts were excised form the phage 

15 vector using EcoRI digestion and subcloned into M13mpl9 

for sequencing. Sequence analysis indicated that one cDNA 
insert of approximately 2.6 kilobases encoded the 
full-length clone for the 3.2 receptor. The sequence of* 
the cDNA (together with the putative amino acid sequence) 

20 insert encoding the 3.1 receptor (the human A2a adenosine 
receptor) is shown in Figure 2 (together with the deduced 
amino acid sequence of the human A2a adenosine receptor) 
whilst the sequence of the cDNA insert encoding the 3.2 
receptor (the human A2b adenosine receptor) is shown in 

25 Figure 3 (together with the deduced amino acid sequence). 
Expression of the cloned Al. A2a and A2b adenosine 
receptors in mammalian cells 

Each cloned full-length cDNA was subcloned into a 
mammalian cell expression vector (pcDNAlneo for A2a and 

30 A2b and pRc/CMV for Al ( Invitrogen) ) in such a way as to 
direct expression of the encoded receptor portion. 

Mammalian cell lines (Chinese Hamster Ovary - CHO Kl 
or Human Embryonic Kidney - HEK 293) were independently 
transfected with the recombinant expression vectors and 

35 cell lines established which had stably integrated the 
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cloned receptor DNA. The stably trans fected cell lines 
were examined for their ability to bind a range of 
adenosine analogues as shown in Figure 4. Furthermore, 
the effect on cyclic AMP (cAMP) levels of receptor 
5 activation by adenosine agonists was examined as shown in 
Figure 5. 

These studies demonstrate that cDNA clone 3.1 encodes 
an adenosine A2a receptor, cDNA clone 3.2 encodes an 
adenosine A2b receptor and that the Al cDNA encodes an 

10 adenosine Al receptor. Generation of significant amounts 
of purified receptor protein, made possible by this 
invention, can be used as a tool to facilitate the design 
and chemical synthesis of highly specific agonists and 
antagonists for each receptor subtype. Knowledge of the 

15 primary sequence differences between the related receptor 
subtypes as determined by this invention provides crucial 
information for the design of receptor isubtype specific 
agonists and antagonists. 

It will be appreciated by persons skilled in the art 

20 that numerous variations and/or modifications may be made 
to the invention as shown in the specif ic embodiments 
without departing from the spirit or scope of the 
invention as broadly described. The present embodiments 
are, therefore, to be considered in all respects as 

25 illustrative and not restrictive. 
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CLAIMS J - 

1. A DNA molecule encoding the human Al adenosine 
receptor, the DNA molecule having a sequence substantially 
as shown in Figure 1 or a functionally equivalent sequence 
5 2. A DNA molecule encoding the human A2a receptor 

subtype, the DNA molecule having a sequence substantially 
as shown in Figure 2 or a functionally equivalent sequence 

3. A DNA molecule encoding the human A2b adenosine 
receptor subtype, the DNA molecule having a sequence 

10 substantially as shown in Figure 3 or a functionally 
equivalent sequence* 

4. A method of producing the human Al adenosine receptor 
comprising culturing a cell transformed with the DNA 
molecule as claimed in Claim 1 under conditions which 

15 allow expression of the DNA sequence such that the human 
Al adenosine receptor is expressed on the cell surface and 
optionally recovering the human Al adenosine receptor, 

5. A method of producing a human A2a adenosine receptor 
comprising culturing a cell transformed with the DNA 

20 molecule as claimed in Claim 2 under conditions which 
allow expression of the DNA sequence such that the human 
A2a adenosine receptor is expressed on the cell surface 
and optionally recovering the human A2a adenosine receptor 

6. A method of producing a human A2b adenosine receptor 
25 comprising culturing a cell transformed with the DNA 

molecule as claimed in Claim 3 under conditions which 
allow expression of the DNA sequence such that the human 
A2b adenosine receptor is expressed on the cell surface 
and optionally recovering the human A2b adenosine receptor 
30 7. A method of screening a molecule for adenosine 

agonist or antagonist activity, comprising contacting the 
molecule with the human Al, A2a and A2b adenosine 
receptors produced by the method as claimed in any one of 
Claims 3 to 6. 
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Sequence Range: 1 to 1290 



10 20 30 40 

• * * * 

CGC AGG ATG GX6 CTT GCC TOG TGC CCC TTG GIG OOC GTC TGC TGA TGI 

50 60 70 80 90 

# * # * * 

GCC CAG OCX GIG CCC GCC ATG CCG CCC TCC ATC TCA GCT TIC CAG GCC 

Met Pro Pro Ser He Ser Ala Phe Gin Ala> 

100 110 120 130 140 

* * * * * 

GCC TAC ATC GGC ATC GAG GIG CTC ATC GCC CTG GTC TCT GTG CCC GGG 

Ala Tyr He Gly He Gla Val Lea He Ala Lea Val Ser Val Pro Gly> 

150 160 170 180 190 

* * * * * 

AAC GTG CTG GTG ATC TGG GCG GTG AAG GIG AAC CAG GCG CTG CGG GAT 

Asa Val Lea Val lie Trp Ala Val Lys Val Asn Gin Ala Lea Arg Asp> 

200 210 220 . 230 240 

GCC ACC TTC TGC TTC ATC GTC TOG CTG GCG GTG GCT GAT GTG GCC GTG . 

Ala Thr Phe Cys Phe lie Val Ser Lea Ala Val Ala Asp Val Ala Val> 

250 260 270 280 

* # * . * 

GGT GCC CTG GTC ATC CCC CTC GCC ATC CTC ATC AAC ATT GGG CCA CAG 

Gly Ala Lea Val lie Pro Lea Ala lie Lea lie Asn lie Gly Pro Gln> 

290 300 310 320 330 

* * * * # 

ACC TAC TTC CAC ACC TGC CTC ATG GTT GCC TGT CCG GTC CTC ATC CTC 

Thr Tyr Phe Bis Thr Cys Lea Met Val Ala Cys Pro Val Lea lie Lea> 

340 350 360 370 380 

* * * * * 

ACC CAG AGC TCC ATC CTG GCC CTG CTG GCA ATT GCT GTG GAC CGC TAC 
Thr Gin Ser Ser He Lea Ala Lea Lea Ala He Ala Val Asp Arg Tyx> 

390 400 410 420 430 

* * * * * 

CTC CGG GTC AAG ATC GCT CTC CGG TAC AAG ATG GTG GTG ACC CCC CGG 

Lea Arg Val Lys lie Pro Lea Arg Tyr Lys Met val Val Thr Pro Arg> 

440 450 460 470 480 

* * • * * 

■ 

AGG GCG GCG GTG GCC ATA GCC GGC TGC TGG ATC CTC TCC TTC GTG GTG 

Arq Ala Ala Val Ala lie Ala Gly Cys Trp lie Leu Ser Phe Val Val> 

PIG.l 
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49 J 500 5X0 520 

<ay Leu Thr Pro Met Pbe Gly T*p Asa Asn Leo Ser Ala Val eiu Arg> 
530 540 550 



560 570 



Ala Trp Ala Ala Asa Gly Ser Met Gly Glu Pro Val lie Lys Cys Glu> 

59 2 €00 ao 620 

_ * * * # 

TO^^GTCATCAGCATGGW31MATCGIC1MTTCAACTTCTO 
Phe Glu Lys Val ne Ser Met Glu Tyr Met Val ^ S A^ PheS> 



630 «40 650 660 



670 



GIG TGG GTG CTG COC O0G CTT CTC CTC ATG GTC CTC ATC TAC CTG GAG 
Val Trp Val Leu Pro Pro Lea Leu Leu Met Val Lea lie Tyr Leu Glu> 

680 690 700 710 720 

* * • * * 

GXC TTC TAC CIA ATC OGC AAG CAG CTC AAC AAG AAG GTG TCG GCt TCC 
Val Phe Tyr Leu lie Arq Lys Gin Leu Asn Lys Lys Val Ser Ala Ser> 

730 740 750 760 

* * * * 

TCC GGC GAC CCG CAG AAG TAC TAT GGG AAG GAG CTG AAG ATC GCC AAG 

Ser Gly Asp Pro Gin Lys Tyr Tyr Gly Lys Glu Leu Lys lie Ala Lys> 

770 780 790 800 810 

* * * * * 

TCG CTG GCC CTC ATC CTC TTC CTC TTT GCC CTC AGC TGG CTG OCT TTG 

Ser Leu Ala Leu He Leu Phe Leu Phe Ala Leu Ser Trp Leu Pro Lea> 

820 830 840 850 860 

* * * * * 

CAC ATC CTC AAC TGC ATC ACC CTC TTC TGC CCG TCC TGC CAC AAG CCC 

Bis lie Leu Asn Cys lie Thr Leu Phe Cys Pro Ser Cys His Lys Pro> 

870 880 890 900 910 

* • * * * 

AGC ATC CTT ACC TAC ATT GCC ATC TTC CTC ACG CAC GGC AAC TCG GCC 
Ser lie Leu Thr Tyr lie Ala lie Phe Leu Thr His Gly Asn Ser Ala> 

PIG.l (cont'd.) 
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920 930 940 950 960 

* * # # # 

ATG AAC CCC ATT GIC TAT GCC TTC CGC ATC CA6 AA6 TTC CGC GTC ACC 
Pro He Val Tyr Ala Phe Arg lie Gin Lys Phe Axg Val Thr> 

970 980 990 1000 

* * * # 

AAG ATT TGG AAT GAC CAT TTC CGC TGC CAG OCT GCA CCT CCC 
Lys lie Txp Asn Asp Bis Phe Arg Cys Gin Pro Ala Pro Pro> 

1010 1020 1030 1040 1050 

* * * * * 

ATT GAC GAG GAT CTC CCA GAA GAG AGG CCT GAT GAC TAG ACC CCG CCT 

lie Asp Gin Asp Leu Pro Gin Gin Arg Pro Asp Asp +**> 

1060 1070 1080 1090 1100 

* * * * * 

TCC GCT CCC ACC AGC CCA CAT CCA GTG GGG TCT CAG TCC ACT CCT CAC 

1110 1120 1130 1140 1150 

* * * * ' * 

ATG CCC GCT GTC CCA GGG GTC TCC CTG AGC CTG CCC CAG CTG GGC TGT 

1160 1170 1180 1190 1200 

* * * * * 

TGG CTG GGG GCA TGG GGG AGG CTC TGA AGA GAT ACC CAC AGA GIG TGG 

1210 1220 1230 1240 

* * * * 

TCC CTC CAC TAG GAG TTA ACT ACC CTA CAC CTC TGG GCC CTG CAG GAG 

1250 1260 1270 1280 1290 

* * * * * 

GCC TGG GAG GGA AGG GTC CTA CGG AGG GAC CAG GTG TCT AGA 



PIG.l (cont'd.) 
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Sequence Range: 1 to 



10 



20 



30 



40 



GAA TIT TCA GCT GTT CTT TGC TCA AXA AXA ACT ITT TTA TCA CCA AGA 



50 
• 



100 
* 

TTC TTA 
ISO 
AXA TCT 



60 
AAG TIT 
110 



70 



80 



TTG ACA TAT TCC TCA TTT GTT 



120 
* 



130 
* 



90 

• * 

TTG AXA AAA 
140 



TTT TCT TAG AAA AAT AAG TTA CTA AAA GTC AXA TAT CAT TGT 



160 170 
AAA TAT TGC TTA AAA CIA GGA 



180 



190 

CTT GIA TTT AAA TGT TTT 



200 210 
TTA AAG ACA ATT TGC 



220 



250 



260 



270 



230 
* 

GAA CCC TGA 
280 



240 



290 



340 
* 



TGA TGC TGC TGC GAG AAC 
300 310 

GTC CTC TGT GAA AAA 
350 360 



TGC AGA GGG OCT GGT 

320 330 

*. * 

CTT GGA GAG CGC CCC 
370 380 



AGG GCT GGA CTT GGC TCC TGT GAG GAA GGG GCT CAG GGG TCT GGG CCC 



390 



400 



410 



420 



430 



440 



450 
* 



460 



470 



480 



AAT GGA COG TGA GCT GGC CCA GGC CGC GTC CGT GCT GAG CCT GCC TGT 



490 

CTG TGG CC ATG 



540 
+ 

GCC ATT GCT GTG 

Ala lie Ala Val 



500 

AIC ATG GGC 

ne Met Gly 

550 

GCC ATC 
Ala lie 

FIG. 2 



510 520 
GIG TAC AIC 

val iyr lie 
560 570 



530 



Thr Val Glu> 



GGC AAT GIG CTG GIG TGC 

Gly Asn Val Leu Val Cys 
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580 590 600 610 620 

* * * * * 

GCC GIG TGG CIC AAC AGC AAC CIG CAG AAC GTC AOC AAC TAC TIT GIG 
Ala val Trp Leu Asn Ser Asn leu Gin Asa Val Thr Asn Tyr Phe Val> 

630 640 650 660 670 

* * * * * 

GIG TCA CIG GOG GOG GCC GAC ATC GCA GIG GGT GIG CTC GCC A3C CCC 

Val Ser Leu Ala Ala Ala Asp lie Ala Val Gly Val Leu Ala lie Pro 

680 690 700 710 720 

• * * * * 

TTT GCC ATC AOC ATC AGC AOC GGG TTC TGC GOT GCC TGC CAC GGC TGC 
Phe Ala lie Thr lie Ser Thr Gly Phe Cys Ala Ala Cys Bis Gly Cys> 

730 740 750 760 770 

* * * * * 

CTC TTC ATT GOO TGC TTC GTC CTG GTC CTC ACG CAG AGC TCC ATC TTC 

Leu Phe lie Ala Cys Phe val Leu Val Leu Thr Gin Ser Ser lie Phe> 

780 790 800 810 

* * * * 

AGT CTC CTG GCC ATC GCC ATT GAC CGC TAG ATT GCC ATC CGC ATC COG 

Ser Leu Leu Ala lie Ala lie Asp Arg Tyr lie Ala He Arg lie prt» 

820 830 840 850 860 

* * * * * 

CTC CGG TAC AAT GGC TTG GIG ACC GGC ACG AGG GOT AAG GGC ATC ATT 

Leu Arg Tyr Asn Gly Leu Val Thr Gly Thr Arg Ala Lys Gly lie ne> 

870 880 890 900 910 

* * •*..-* * 

GOO ATC TGC TGG GIG CIG TCG TTT GCC ATC GGC CTG ACT CCC ATG CIA 
Ala lie Cys Trp Val Leu Ser Phe Ala lie Gly Leu Thr Pro Met Len> 



930 940 950 960 

* * * * * 

GGT TGG AAC AAC TGC GGT CAG CCA AAG GAG GGC AAG AAC CAC TCC CAG 
Gly Trp Asn Asn Cys Gly Gin Pro Lys Gin Gly Lys Asn Bis Ser Gln> 

970 980 990 1000- 1010 

* * * * * 

GGC TGC GGG GAG GGC GAA GIG GCC TGT CTC TTT GAG GAT GTG GTC CCC 
Gly Cys Gly Glu Gly Gin Val Ala Cys Leu Phe Glu Asp Val Val Pro 

1020 1030 1040 1050 

* * + * 

ATG AAC TAC ATG GIG TAC TTC AAC TTC TTT GCC TGT GTG CTG GTG CCC 

Met Asn Tyr Met Val Tyr Phe Asn Phe Phe Ala Cys Val Leu Val Pro 

1060 1070 1080 1090 1100 

***** 

CTG CTG CTC ATG CTG GGT GTC TAT TTG CGG ATC TTC CTG GOG GCG CGA 
Leu Leu Leu Met Leu Gly Val Tyr Leu Arg lie Phe Leu Ala Ala Arg> 

1110 1120 1130 1140 1150 

* * * * * 

CGA CAG CIG AAG CAG ATG GAG AGC CAG OCT CTG COG GGG GAG CGG GCA 

Arg Gin Leu Lys Gin Met Glu Ser Gin Pro Leu Pro Gly Glu Arg Ala> 

FIG .2 (cont'd.) 
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HfiO H70 1130 1190 1M0 

* * * * 

COG TOC ACA CTG CAG AAG GAG GTC CAT GCT GOC AAG TCA CTG GCC AIC 

to Len Gin Lys Glu Val His Ala Ala Lya Ser Leu Ala ~- 



1210 



1220 1230 1240 1250 



AIT GIT GGG CTC ITT GOC CTC TGC TGG CTG OCC OCA CAC MC MC AAC 
neV*lGlyLeuP!*AULeuCysTrpLOTProLeuBisIleIle 

1260 1270 1280 1290 



CysPbeltorPhePbeCysProAspCysSer His Ala Pro Leu Trp Leuo 

1300 1310 1320 1330 1340 

ATOTACCIGGCCATCGICCIC1CCCACACCAAT TCG CTT CTG AM CCC 

Uetiyrlen Ala He Val Leo Ser Bis Thr Asn Ser val val Asa Pro> 
1350 1360 1370 1380 1390 



* 



TTC AIC TAC GCC TAC CGT AIC CGC GAG TTC CGC CAG ACC TTC CGC AAG 
Ffae He Tyr Ala Tyr Arq lie Axq Gin Phe Arq Gin Thr Phe Arq Lys> 

1400 1410 1420 1430 1440 

• * # * * * 

AIC AIT CGC AGC CAC GTC CTG AGG CAG CAA GAA OCT TTC AAG GCA GCT 
He He Arg Ser His Val Lea Arg Gin Gin Gin Pro Phe Lys Ala Ala> 

1450 1460 1470 1480 1490 

* * * ■ - i * * 
GGC ACC ACT GCC CGG GTC TTG GCA GCT CAT GGC AGT GTC GGA GAG CAG 
Gly Thr Ser Ala Arg Val Leu Ala Ala His Gly Ser Val Gly Gin Gln> 

1500 1510 1520 1530 

* * * • * 

GTC AGC CTC CGT CTC AAC GGC CAC CCG CCA GAG GIG TGG GCC AAC GGC 
val Ser Lea Arg Lea Asn Gly Bis Pro Pro Gin val Trp Ala Asn Gly> 

1540 1550 1560 1570 1580 

* * * ♦ * 

AST GCT CCC CAC OCT GAG CGG AGG CCC AAT GGC TAC GCC CTG GGG CTG 
Ser Ala Pro His Pro Gin Arg Arg Pro Asn Gly Tyr Ala Leu Gly Les> 

1590 1600 1610 1620 1630 

* * * * * 

GTG AGT GGA GGG AGT GCC CAA GAG TCC CAG GGG AAC ACG GGC CTC CCA 

val Ser Gly Gly Ser Ala Gin Gin Ser Gin Gly Asn Thr Gly Lea Pro 

1640 1650 1660 1670 1680 

* * * * * 

GAC GTG GAG CTC CTT AGC CAT GAG CTC AAG AGA GTG TGC CCA GAG CCC 

Asp val Gin Leu Leu Ser Bis Gin Leu Lys Arq val Cys Pro Gin Pro> 

1690 1700 1710 1720 1730 

• * # * * 

OCT GGC CXA GAT GAC CCC CTG GCC CAG GAT GGA GCA GGA GTG TCC TGA 

Pro Gly Leu Asp Asp Pro Leu Ala Gin Asp Gly Ala Gly Val Ser ***> 

1740 1750 1760 1770 

* * * * 

TGA TTC AIG GAG TTT GCC OCT TCC TAA G GGA AGG AGA TCT TTA TCT TTC 
*+* Phe Met Gin Phe Ala Pro Ser ***> 

FIG. 2 (cont'4-) 
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1780 1790 1800 1810 1820 

* * * * * 

TGG TTG GCT TGA CCA GIC ACG TTG GGA GAA GAG ASH GAG TGC CAS GAG 

1830 1840 1850 1880 1870 

* * • # • 

AOC CTG AGG GCA GCC GGT TCC TAG TTT GGA CTG AGA GAA GGG AGC COC 

1880 1890 1800 1810 1820 

* * • * * 

AGG CXG GAG CAG CAT GAG GCC CAG CAA GAA GGG CIT GGG TTC TGA Gtt 

1930 1940 1950 I960 1970 

* * • * • 

AGC AGA TGT TIC ATG CTG TGA GGC CIT GCA OCA GGT GGG GGC CAC AGC 

■ 

1880 1990 2000 2010 

* * * • 

AOC AGC AGC ATC TTT GCT GGG CAG GGC CCA GCC CTC CAC TGC AGA AGC 

2020 2030 2040 2050 ' 2080 

* • * • * 

ATC TGG AAG CAC CAC CIT GIC TOC ACA GAG CAG CTT GGG CAC AGC AA 



2070 2080 2090 2100 2110 

* * * * * 



2120 2130 2140 2150 2160 



2170 2180 2190 2200 2210 

* • * * # 

COC AGA GGT GAC ATT TGA CTT TTT TIC CAG GAA AAA TGT AAG TGT GAG 

2240 2250 



2270 2280 2290 2300 



2340 2350 
* * * * * 

CTC AGG CAG TCC TCT OCT GCT GIC ACA GCT GCC ATC CAC TTC TGA GIC 

2380 2370 2380 2390 2400 

* * • * • 

CCA GGG CCA TCT CTT GGA GIG ACA AAG CTG GCA TCA AGG ACA GGG AGT 

2410 2420 2430 2440 2450 
* * * * • 

TCT AAC AGA GCA GIG CCA GAG CAT GGG OCC AGG ICC CAG GGG AGA GGT 

2480 2470 2480 2490 

* * • * 

TGG GGC TGG CAG GCC ACT GGC ATG TGC TGA GTA GOG CAG AGC TAC CCA 

2500 2510 2520 2530 2540 

• * * « • 

CTG AGA GGC CTT GIC TAA CTG COT TIC CTT CIA AAG GGA ATG TIT TIT 
2550 2560 2570 

TCT GAG AIX AAA TAA AAA CGA GCC ACA G FIG . 2 (cont . d ., 
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Sequence Range: 1 to 1687 



10 20 30 40 

♦ * * * 

AGO COC GAG GCT CAG AAG CGG CAG GOG GAG GOG OGG TOO GGG OGC 

70 80 90 

* * * 



100 110 120 130 140 

* * * * * 

OCT TOG GXA GGG GGC GCC OGG GGC OCA GOT GGC OCG GOC ATG CTG GIG 



ISO 160 170 180 190 

* # * * * 

GAG ACA GAG GAC GOG CTG TAC GIG GOG CTG GAG CTO GIG ATC GOO GOG 

Gin Thr Gin Asp Ala Leu Tyr Val Ala Leu Gin Leu Val He Ala Ala> 

200 210 220 230 240 

* * * * * 

CTT TOG GIG GOG GGC AAC GIG CTG GIG TGC GOC GCG GIG GGC AOG GOG 

Leu Ser Val Ala Gly Asn Val Leu Val cy s Ala Ala Val Gly Thr Ala> 

250 260 .270 280 

■ * * * * 

AAC ACT CTG CAG ACG GGC ACC AAC TAC TTC CTG GTG TOO CTG GCT GCG 

Asn Thr Leu Gin Thr Pro Thr Asn Tyr Phe Leu Val Ser Leu Ala Ala> 

290 300 310 320 330 

* * * * * 

GCC GAC GTG GOC GTG GGG CTC TTC GCC ATC CCC TIT GOC ATC ACC ATC 

Ala Asp Val Ala val Gly Leu Phe Ala lie Pro Phe Ala lie Thr He> 

340 350 360 370 380 

* * *. * * 

AGO CTG GGC TTC TGC ACT GAC TTC TAC GGC TGC CTC TTC CTC GCC TGC 
Ser Leu Gly Phe Cys Thr Asp Phe Tyr Gly Cys Leu Phe Leu Ala Cys> 

390 400 410 420 430 

* * * * * 

TTC GIG CTG GTG CTC ACG CAG AGO TCC ATC TIC AGC CTT CTG GOC GTG 
Phe Val Leu Val Leu Thr Gin Ser Ser lie Phe Ser Leu Leu Ala Val> 

440 450 460 470 480 

* * * * * 

GGA GTC GAC AGA TAC CTG GCC ATC TGT GTC CCG CTC AGG TAT AAA AGT 

Ala Val Asp Arg Tyr Lets Ala He Cys Val Pro Leu Arg Tyr Lys Ser> 

490 500 510 520 

* * * * 

TTG GTC ACG GGG ACC CGA GCA AGA GGG GTC ATT GCT GTC CTC TGG GTC 

Leu Val Thr Gly Thr Arg Ala Arg Gly Val He Ala Val Leu Trp Val> 

FIG. 3 
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530 540 550 560 570 

* * * * * 

CJT GCC TTT GGC ATC GGK TTG ACT CCA TTC CTG GGG TGG AAC AST AAA 
lea Ala Phe Gly lie Gly Len Thr Pro Phe Leo Gly Trp Asa Ser Lys> 

580 590 600 610 620 

* * * * * 

GAC ACT GCC ACC AAC AAC TGC ACA GAA CCC TGG GAT GGA ACC AC6 AAT 
Ser Ala Thr Asa Asa Cys Thr Gin Pro Trp Asp Gly Thr Thr Asa> 

630 640 650 660 670 

* * * * * 

GAA AGC TGC TGC CTT GIG AAG TGT CTC TTT GAG AAT GTG GTC CCC ATG - 
Gin ser Cys Cys leu val Lys Cys Leu Phe Gin Asa Val Val Pro Met> 

* 

680 690 700 710 720 

* * # * * 

AGC TAC ATG GTA TAT TTC AAT TTC TTT GGG TGT GIT CTG CCC CCA CTG 
Ser Tyr Met Val Tyr Phe Asn Phe Phe Gly Cys Val Lea Pro Pro Leo> 

730 740 750 760 

* * * * 

CTT ATA ATG CTG GIG ATC TAC ATT AAG ATC TTC CTG GTG GCC TGC AGG 

Lea lie Met Lea Val lie Tyr lie Lys He Phe Lea Val Ala Cys Arg> 

770 780 790 800 810 

* • * * * 

GAG CTT CAG CGC ACT GAS CTG ATG GAC CAC TCG AGG ACC ACC CTC GAG 

Gin Lea Gin Arg Thr Gla Lea Met Asp Bis Ser Arg Thr Thr Lea Gln> 

820 830 840 850 860 

* * * * * 

CGG GAG ATC CAT GCA GCC AAG TCA CTG GCC ATG ATT GTG GGG ATT TTT 

Arg Gla lie Bis Ala Ala Lys Ser Lea Ala Met lie Val Gly lie Phe> 

870 880 890 900 910 

* * * * * 

GCC CTG TGC TGG TTA OCT GTG CAT GCT GTT AAC TGT GTC ACT CTT TTC 

Ala Lea Cys Trp Lea Pro Val Bis Ala Val Asn Cys Val Thr Lea Phe> 

920 930 940 950 960 

* * * * * 

CAG CCA GCT CAG GGT AAA AAT AAG CCC AAG TGG GOV ATG AAT ATG GCC 

Gin Pro Ala Gin Gly Lys Asn Lys Pro Lys Trp Ala Met Asn Met Ala> 

970 980 990 1000 

* * •* * 

ATT CTT CTG TCA CAT GCC AAT TCA GTT GTC AAT CCC ATT GTC TAT GCT 

Lea Ser Bis Ala Asn Ser Val Val Asn Pro lie Val Tyr Ala> 



1010 1020 1030 1040 1050 

* * * * 



AAC CGA GAC TTC CGC TAC ACT TTT CAC AAA ATT ATC TGC AGG 

Tyr Arg Asn Arg Asp Phe Arg Tyr Thr Phe Bis Lys lie ne Ser Arg> 

1060 1070 1080 1090 1100 

* * * * * 

TAT CTT CTC TGC GAA GCA GAT GTC AAG AGT GGG AAT GGT CAG GCT GGG 

TyrLeuLenCy?GlnAlaAspV*lLysSer Gly Asn Gly Gin Ala Gly> 

FIG. 3 (con^d.) 
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1U0 1120 1130 1140 



1150 



* * * * * 

G3A CAG OCT GCT C1C GGT GIG GGC CIA TGA TCT AGG CIC ICG OCT CIT 
V&l On Pro Ala Lett Gly Val Gly Lea ***> 

n6 ° H70 1180 1190 1200 

***** 
CCA GGA GAA GAT AQk AAT OA CAA GAA Ad AAG AG6 ACA CG6 CTG GTT 

1210 1220 1230 1240 
* * * * 

TTC ATT GTG AAA GAT AGC TAC ACC TCA CAA GGA AAX GGA CTG CCT CTC 

12S0 1260 1270 1280 1290 

* * * * * 

TTG AGC ACT TCC CTG GAG CTA CCA CGT ATC TAG CIA ATA TGT ATG TCT 

1300 1310 1320 1330 1340 

* * * * * 

GAG TAG TAG CAC CAA GGA TTG ACA AAX ATA TTT ATG ATC TAT TCA GCT 

1350 1360 1370 1380 1390 

***** 

GCT TTT ACT GTG TGG ATT ATG CCA ACA GCT TGA ATG GAT TCT AAC AGA 

1400 1410 1420 1430 1440 

* * * * * 

CTC TTT TGT TTT TAA AAG TCT GCC TTG TTT ATG GTG GAA AAT TAC TGA 

1450 1460 1470 1480 

* * * * 

AAC TAT TTT ACT GTG AAA CAG TGT GAA CIA TTA TAA TGC AAA TAC TTT 

1490 1500 1510 1520 1530 

* * * * . * 

TTA ACT TAG AGG CAA TGG AAA AAT AAA ACT TGA CTG TAC TAA AAA TGT 

1540 1550 1560 1570 1580 

* ■* * * * 

ATA CTT GTT GCC AGG AAG GIG ACC TCA AAA ATT AAA ACT ATA ATT ATT 

1590 1600 1610 1620 1630 

* * * * * 

CGG COG GGC ATG GTG GCT CAC ACC TGT AAT TCC AGC ACT TTG GGA GGC 

1640 1650 1660 1670 1680 

* * * * * 

CAA GGC AGG CGG ATC AGG AGG TCA GGA GTT CAA AAC CAG CCT GTC CAA 
TAT AGT G 

FIG. 3 (cont f d.) 
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